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Method and System for Delivering Media Selections through a Network 
Field of the Invention 

5 This invention relates to networked multimedia delivery system. More 

particularly, this invention relates to a media delivery system for delivering media 
selection to a plurality of media client over a hybrid multicast/unicast network. 



10 



Background of the Invention 



In a true Video-on-Demand (VOD) system, users are allowed to view any video 
programs at any time and perform any VCR-like interactive functions, such as fast 
forward/fast rewind, jump forward/jump rewind, slow motion and pause. It can be easily 
achieved by dedicating a channel to each user. However, this method is . very expensive, 
15 inefficient and not scalable. Thus, multicast delivery is regarded as one of the solutions to 
reduce the cost and increase the scalability of a large-scale VOD system. A multicast 
stream can be shared by a large number of users. Unfortunately, it is difficult to 
implement interactive functions for multicast streams. It is a challenging and hot topic in 
recent years as to find out how to satisfy one user's interactive requests without affecting 
20 other users in the same multicast group. 



Many researches have tried to solve this problem. One of the studies is to provide 
limited VCRTlunctions based on the butter size oflhe set-top boxrlnteTairti^^ 
such as fast forward can only be performed by using the frames already stored in the — 
25 buffer. Thus, a large amount of butter is needed in order to get better VCR functions. 
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Furthermore, this technique cannot serve certain interactive functions such as jump 
forward/backward which involve a change in the buffer content. In another study, it is 
proposed that user interactions can be handled by creating a unicast stream. This new 
stream may be held on till the end of the video. It means that all the users may eventually 
5 hold an individual stream rather than share the same multicast stream. Thus, the 
scalability is reduced or many interactive requests may be blocked. Such problems limit 
the usefulness of such systems, particularly in the case when such systems are 
implemented in metropolitan areas having millions of users. 

I o Therefore, it is an object of this invention to resolve at least some of the problems 

at set forth in the prior art. As a minimum, it is an object of this invention to provide the 
public with a useful choice. 



Summary of the invention 



15 



Accordingly, this invention provides a method for delivering media to a plurality 
of media client having a buffer for caching media of a selected media stream within one 
stream interval and processing capability for playing the media in a multicast media 
stream through a network, including the steps of: 
20 - generating plurality of multicast media streams, wherein each multicast 

media stream is repeated at regular stream intervals; 

joining the media client to a selected multicast media stream in response to 
a selection request from the media"client; 

caching the buffer of the media client continuously with unplayed media in 



25 the selected multicast media stream; and 
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caching the selected multicast media streams in at least one interactive 
server, 

such that interactive requests and/or errors in playing the media in the media client 
are handled by the interactive server or the media server. 

5 

It is another aspect of this invention to provide a system for delivering media 
selection to a plurality of media clients having a buffer for caching media of a selected 
media stream within one stream interval and processing capability for playing the media 
in a multicast media stream through a network, including 
10 at least one media server for generating a plurality of multicast media 

streams, wherein each multicast media stream is repeated at regular stream 
intervals, and the media client is joined to a selected multicast media 
stream in response to a selection request from the media client 
at least one interactive server for caching the selected multicast media 
15 stream 

such that interactive requests and/or errors in playing the media in the media client 
are handled by the interactive server or the media server. 
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Brief description of the drawings 

Preferred embodiments of the present invention will now be explained by way of 
example and with reference to the accompany drawings in which: 

Figure 1 shows the overaflTarcStecture ot toe media delivery system of tMs~ 



invention. 
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Figure 2 shows the media stream scheduling of a particularly preferred 
embodiment of this invention generated by a media server. 

Figure 3 shows how the media client mergers back to the multicast media stream 
after an interactive operation is performed. 
5 Figure 4 shows the pause operations, and Figure 5 shows the corresponding 

change of the media client's buffer during a pause operation. 

Figure 6 shows the change in the media client's buffer during a slow motion 
operation- 
Figure 7 shows how to determine the suitable multicast media stream after a fast 

1 o forward operation. 

Figure 8 shows the difference between play-point for fast forward operation and 
normal play back operation. 

Figure 9 shows the difference between play-point for fast rewind operation and 
normal play back operation. 
15 Figure 10 shows how to determine the suitable multicast media stream after a 

jump forward operation. 

Detail Description of Preferred Embodiments 

20 This invention is now described by ways of example with reference to the figures 

in the following sections. List 1 is a part list so that the reference numerals may be easily 
referred to. 

Although the following description refers the media or the media selectionsto "be - 
"25 delivered is video, it is expressly understood that media in other iorms may also be 
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delivered in the system of this invention instead of video, for example audio, or their 
combination. 

1 . System Architecture 
5 1.1 Overview 

The overall system architecture of the media delivery system (10) of this invention 
is shown in Fig. 1 . The system is composed of four main components: 

a) at least one media server . The media server may be a stand alone server or 
can be a member of the Video Server Cluster VSC (12) as shown in Fig.l ; 
10 b) a plurality of media clients (14) being Client Stations CS; 

c) a network (16) which may be represented as a Multicast Backbone Network 
MBN (20) and a plurality of Local Distribution Networks LDN (22); and 

d) at least one interactive server (18) being the Distributed Interactive Server 
(DIS). 

15 Note that the MBN (20) can use any arbitrary topology that supports the multicast 

protocol. The ring structure shown in Fig.l is used for simplicity and should not be 
interpreted that a token-ring network is required for this invention. 

1 .2 Video Server Cluster VSC H2) . 
20 The role of the VSC (12) is to generate the multicast media streams for the entire 

system. Each VSC (12) consists of at least one and preferably 5 to 15 media servers. The 
number of media servers in the cluster may be altered as desired. 

Preferably, each media server stores part of the video contentln^^iffipi^^t^ 
format with parity added for forward error correction, like RAID 5. This allows a simple— 
25 error recovery it the US (14) missed out one video block out of the parity group. Also, 
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the entire VSC (12) can still be operational even when some of the media servers are 
down, achieving some degrees of fault-tolerance. Other known stripping scheme may be 
employed in this invention. 

The video blocks sent by the VSC (12) are preferably interleaved randomly to 
5 minimize the impact of burst block errors and increase system security. Since blocks are 
most likely to drop in a busty manner, by interleaving the packets sent by the VSC (12), 
missing packets are scattered more evenly. Hence, the simple parity scheme may be able 
to recover most, if not all of the missing packets 

In addition, block interleaving discourages eavesdroppers to capture the video 
1 0 blocks for viewing. A pseudo random sequence may be used for generating the random 
interleaving: the generating key of the sequence is passed to the CS (14) by a public key 
encryption algorithm during channel establishment. As a result, the CS (14) can reorder 
the video sequence from the regenerated pseudo random sequence. 

To provide interactive functions to the user, the multicast media streams are 
15 repetitively started at fixed regular stream intervals, for example, every thirty to sixty 
seconds, at the VSC (14) to serve a plurality of media streams to the majority of 
customers not performing any interactive functions. The media stream scheduling of 
thirty seconds stream interval is shown in Fig.2. 

The stream interval may be chosen at any desired value basing on the scale and 
20 the performance of the system (10). However, the stream interval is preferably set at 
around 30 - 60 sec, such that the average start up time may be around 15-30 sec, which 
is acceptable. Although a large number of multicast media streams are needed, it is 
worthwhile to do so as it can provid^aljetter quality~uf^Tvi^^^ 

the buffer requirement at the client side for fully interactive functions. Moreover,- with- - 
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more multicast media streams, the number and the holding time of the unicast streams 
required for providing interactivity and merging may be reduced. 

1 .3 Client Station CS (14) 
5 Each CS (14) should have a buffer that can hold media contained in the multicast 

media streams for up to one stream interval of video blocks. For stream interval equals to 
30 sec and MPEG-2 video (2 to 4 Mb/s), that amounts to -8-15 MB. With a simple parity 
for error correction, such as 10 servers with 1 as parity, the buffer required is around 15* 
10/9 or 16.7 MB. Therefore 32MB of buffer for each CS appears to be sufficient. 
10 Other than memory requirement, each CS (14) should have a network connection 

such that the media streams can by delivered to CS (14). The network connection is 
preferably broadband network connection which can allow -1.5 to 2 times of the MPEG- 
2 transmission rate. Furthermore, each CS (14) should have enough processing capability 
for playing the media in the multicast media stream. A low-end equipped with Pentium- 
1 5 266 together with a hardware or software MPEG-2 decoder is found to be satisfactory. 

1.4 Network (1 6) 

1 .4. 1 . Multicast Backbone Network MBN (20) 

There is no particular requirement on the underlying network (16), other than that 
20 it should be capable of supplying enough bandwidth for delivering the multicast media 
streams to CS (14). In a real-life application, MBN (20) may be responsible for handling 
several thousands of multicast streams for distribution to the CS (14) through the LBN 

(? 21 : 
Preferably, the MBN (20) is connected to the LDN (22) by a high-speed router:— 
25 Each router should be capable of running the desired multicast routing proxocol such as 
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PIM, MOSPF, DVMRP etc. Ideally, the MBN (20) should be fault-tolerant and can be re- 
routed to alternate routes when necessary. The current IP over DWDM network proposals 
may be used as they seem to provide such a desirable characteristic. In general, the higher 
the bandwidth of the backbone network, the more media streams it can serve to the users. 

5 

1.4.2 Local Distribution Networks LDN (22) 

The LDN (22) carries the multicast media streams down to each CS (14), pruning 
the streams down the way whenever they are not needed. A simple tree network may be 
sufficient for such puipose. 

10 

1.5. Distributed Interactive Server PIS (IS) 

The DIS (18) are mainly responsible for error recovery and generating unicast 
contents in response to interactive requests submitted by CS (14), by caching the 
multicast media streams. Although these functions may be performed by VSC (12), they 
1 5 are preferably performed by DIS (1 8) to reduce overall server and network load. 

Since each multicast stream provided by VSC can be viewed by virtually 
unlimited number of users, while unicast stream for interactive functions is not, a 
distributed approach is chosen so that the system is more scalable. 

One of the functions of the DIS (18) is handle errors in playing the media in CS 
20 (14), including transmitting any video blocks that the CS (14) has not received. When the 
CS (14) failed to reconstruct the missing video block, it sends a request to the DIS for the 
missing video block. 

As the I) IS~CT8 ) is distributed "anai t is closer to ffieCSXt 7 ^^^ 5 ^ 
be minimized and this may improve the response time of the interactive functions and the— - 
25 success rate of retransmission. The multicast stream provides most of the traffic. It was 
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found in related studies that less than 2% of users perform interactive functions at one 
time. Therefore, a low-end DIS (18) may be sufficient for the media delivery system 
(10). 

5 2. Service Provisioning 

The architecture of media delivery system (10) may provide three distinct services 
Class A, B, and C unified in one single framework. 

Class A service is similar to the current cable TV service. Users can watch any 
broadcast channels in a non-interactive manner. To support Class A service, the media 
10 delivery system (10) should be capable of supporting hundreds of non-interactive 
multicast channels. This is provided (as also offered in many other architectures) by 
standard IP multicast channels over the broadband infrastructure. The key issue to be 
resolved is the handling of error retransmission. In this context, Class A service is 
provided by the VSC (12) using multicast IP streams and distributed to the CS (14) via 
15 the network (16). Error recovery and retransmission may be handled by the DIS (18) at 
each LDN to improve error retransmission. 

The Class B service aims at supporting limited media (say hundreds of hours of 
MPEG-2 programs) in a fully interactive manner with high user scalability. This is 
provided by the hybrid multicast-unicast streaming technology of this invention that a 
20 modest amount of system resources may be required to support a large number of users. 
This service is particularly desirable for metropolitan areas where millions of users may 
want to see certain media (such as movie, sports or musical event) without little or even 

no restriction. In the medla^elly-ery^tem^^^ 

of video contents (MPEG-2) can be supported cost effectively for interactive multicast 

25 distribution currently. 
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Class B service is the most complicated and will be explained in the sections 

below. 

The Class C service aims at offering unlimited content to a fixed number of users. 
This service concept is similar to many existing offerings based on a unique unicast 
5 stream for each viewer. The service unfortunately is unscalable in the sense that the 
service provisioning cost is fixed per user and is also fairly high at the current technology 
level. However, when this service is bundled with the previous two DINA services, there 
may be only a small percentage of the entire viewers requesting for this service, thus the 
overall system cost may be reduced drastically. 
10 Class C service is handled in the following manner. When a certain CS (14) 

requests for a Class C service, a dedicated interactive request is submitted by the CS (14) 
asking for a dedicated media. The local DIS (18) will first be checked to see if there 
exists a copy of the dedicated media stored at the local cache of DIS (18). If yes, the local 
DIS (18) will service the request directly by starting a unicast stream. If not, the user 
15 manager will initiate a request to the VSC (12). The VSC (12) will distribute the content 
via a unicast stream to the CS (14) requesting the service, either directly from the VSC 
(12) or indirectly to the DIS (18). Interactivity is handled by the VSC (12) or the DIS 
(18). The cache implemented at the local DIS (18) aims at reducing the number of 
requests to the VSC and the backbone bandwidth required according to the usage 
20 statistics of the media. 

3. Interactive Functions 

Interactive junctions, including tast torward, fast rewind, jump forward, and jump 
backward, for the CS (14) may be performed with a dedicated unicast stream" from™the" — 
25 DIS. Such interactive functions are provided m the Class & service. 
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In general, when the CS (14) requests an interactive function, the CS (14) will first 
leave the multicast group it currently belongs to, then request a unicast stream to handle 
the interactive functions. When the CS (14) finishes the interactive function, it first uses 
the unicast stream for normal playback, but at a higher, say -1.5 to 2X, pump rate. As a 
5 result, the CS's (14) buffer will keep filling up and it will be full after one or two time 
slots. At that point the CS (14) will close the unicast stream to rejoin a suitable multicast 
steam. Although the unicast stream may be left open, this will increases the network 
load. 

In order to provide a limited amount of media to an unlimited number of users in a 
10 fully interactive manner, the batching concept is utilized in this invention, so that the 
users may share the same video stream at the same time while interactive functions may 
still be performed. This may allow one multicast stream to serve many users and reduce 
both the server and network. 

Three types of streams are defined in Class B service: 1) M(i)-stream - which 
15 stands for multicast media stream for normal play starting at the beginning of the i-th 
stream interval; 2) I-stream - which stands for interactive unicast stream opened for the 
interactive functions requested by any user; 3) J-stream - which stands for merging 
unicast stream used by a user to merge back to the M(i)-stream. 

When a user engages in an interactive function, a new unicast stream may be 
20 provided to the user according to what interactive request is raised. Thus an unicast 
stream is said to have split from the original multicast stream. The splitting operation is 
straightforward as a new unicast stream is provided to handle the requested interaction. 
05TffieT)1fi5rilani^^ — 



important as it allows a client on a unicast stream to merge back to the M(i)-stream -and- 
"23 release the 1-stream after performingThe interactive ftoctirmT-A great iwpiovewent in thtr 
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VOD interactive feature may be achieved because the merging operation reduces the 
number of unicast streams. It in turns allows the same number of streams to serve more 
users' interactive requests, thus better quality of service and scalability may be achieved. 
The architecture of the media delivery system (10) can ensure that all the clients 
5 can merge back to M(i)-stream provided that the following requirements are met: 1) the 
CS (14) buffer is large enough to hold all the frames within one stream interval (say 30 - 
60 sec), 2) the J-stream can transmit at a higher rate than the M(i) streams and therefore 
can fill up the necessary buffer before merging. 

As an example, consider a 30-sec MPEG-2 (4.7Mbps) video, the buffer required is 
10 18MB (30*4.7/8). The J-stream is opened as soon as a user has finished all interactive _ 
functions and returns to normal play. The J-stream then transmits frames at a faster rate f. 
Since the incoming data rate is faster than the consumption rate r, the buffer, will fill. up. f 
can be chosen with different values according to the network architecture. Network with 
more bandwidth can support a larger f, which means that the client can merge back to the 

1 5 M(i)-stream in a shorter time. 

Assume the buffer is being filled up at a rate of (f-r) Mbps. In the worst case, the 

required time Tfm to fill up the buffer is, 

_ buffersizex% 

T P .„= sec 

F,ii f _ r 

For an example, if f = 1 .5 *4.7Mbps, then T Ffll = 60 sec . 
20 Once the buffer is filled up, the client must be capable of merging back to a M(i)- 

stream, which is shown in Fig.3. Suppose that after some interactive action and J-stream 



transimssTonj 



CS (14). The current play-point of the stream is at 90sec so the CS (14) buffer stores ffie" 
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frames from 90sec to 120sec of the stream. The CS (14) then leaves the J-stream as no 
more data can be stored, thus freeing up a J-stream to serve other users. 

Since the stream interval between the M(i)-streams is the same as the CS buffer 
size, an M(i)-stream with the current play-time that is within the CS buffer time mark can 
5 always be found. Since CS will have to continue to play frames beyond what have been 
buffered, it can merge back to the appropriate M(k)-stream. By doing so, it can start to 
receive frames at 120sec of the M(k)-stream (i.e. at time mark 290sec relative to the CS). 
At that time, 10 seconds of buffer space in CS can be freed as the frames from 90sec to 
lOOsec have already been played. Thus, this client is successfully merged back to the 
10 shared M(k)-stream. 

The operations of each interactive function that may be preferred to be performed 
in the media delivery system (10) of this invention will now be described in the following 
sections. 

15 Plav/Stop 

For normal play back, the CS (14) first sends a play request to VSC (14), then 
joins the multicast media stream and finally wait for VSC video data. While for stop, CS 
just simply tells the VSC and leaves the selected multicast media stream. During the play 
time, the CS buffer is continuously filled by the media in the selected multicast media 
20 stream. 

To improve the benefit of multicast delivery, the VSC (12) preferably waits for 
some time to fill the buffer of the CS (14) before starting a new selected multicast media 
stream when a selection request is raised~6y r ihe user. 



25 Pause 
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Pause keeps the play-point at its current position. During Pause, the CS buffer 
continues to receive data from the M(i)-stream, while no data is consumed. Thus, data 
will accumulate at the buffer. If normal -play is resumed before the CS buffer is full, the 
5 CS (14) can continue to receive data from the same M(i)-stream. Only the play-point 
position in the buffer is changed. If Pause continues until buffer is full, the CS (14) does 
nothing after the buffer is filled up. It keeps the frames in the buffer for merging. Once 
Resume is activated, it will try to find the appropriate M(i)-stream to merge. The merge 
operation is the same as what has been described in Section C. Suppose the original 
1 0 stream is M(k) and the Pause time is T Pause . The algorithm is as follows: 

If mx stream interval < T Pause 
< (m+ l)x stream interval, 
then merge to M(k+ m) stream 

where m should normally be positive as the CS (14) rejoins the later multicast streams for 
it to maintain the same position in the video. When the play-point is paused near the end 
of the stream and the pause time is large, there may be a wrap around of the streams and 
15 m can take on negative values. 



SlowMotion 

Slow Motion is to play a stream at a slower speed, e.g. 0.5X. During the slow 
motion, data consumption rate is smaller than the arrival rate. Thus data will be 
20 accumulated in the buffer. Similar to pause, if playback is resumed before the CS buffer is 

fatt-tfac cs (14) c an c eatinue-tQ^eceive-data from t h at MQstream. If slow motion 

continues until the buffer is Ml, the CS (14.) must leave the current M(i)-sfream. CS_Q4) 

^TH-corrtmtre-to-pl ay slow motion up t o the end of the buffer . Then CS (14) needs to join 
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the next stream in order to get the required frame for continuing the slow motion. It is 
also necessary for the CS (14) to join the next stream so it can resume normal play at any 
time. 

The CS (14) buffer state shown in Figure 6 helps to explain how slow motion 
5 works, which refer to a specific example of slow motion operation. The time mark is 
relative to the CS (14). In Figure 4, slow motion at 0.5X begins at CS 30sec. Afterwards, 
frames of 5 seconds are played in every 10 seconds of CS-time. However, the incoming 
frame rate is unchanged. Thus, frames of 5 seconds are accumulated in every 10 seconds 
of CS time. The buffer will be full at 80sec and the CS (14) must leave the current M(k)- 
10 stream. Then the CS (14) joins the next M(k+l)-stream to get the missing frames after 
80sec. Since every stream is separated by the same stream interval 30sec, the frames after 
80sec from M(k+l)-stream will be available at CS HOsec. The frames received before CS 
llOsec duplicate with those in the buffer and will be discarded. At CS 120 sec, the CS 
(14) resumes normal play. The play-point position will change to the new M(k+l)-stream 
15 because the old frames have already been played out and the CS (14) resumes normal 
play with the incoming frames from the new M(k+l)-stream. 

Various Speed Fast Forward /Fast Rewind (FF/REW) 

Fast Forward FF or Fast Rewind REW is to play frames faster than the normal 
20 speed. In operation, the CS (14) first tries to use the frames in its own buffer to serve FF 
by skipping some of the frames. If the FF action exceeds the range of the frames in buffer, 
video provided by the I-stream, which is pre-recorded at different speeds for both forward 
and reverse direction FF/ RE W, is uti lized. TKs~scheme norrmlynises^a^wiTith-m^re— 
efficiently, but also provides various speeds for FF/REW actions (e.g. IX for rewind, -2X~- 
25 4X, etc) which are ottered in advanced VUK. 



15 



P/D:9817836 



PCT/IB00/01857 



The I-streams containing the prerecorded media are generated and provided by the 
DIS (18) to the CS (14). For example, when a user requests a 4x FF, the DIS sends the 
pre-record 4X forward I-stream containing video starting at the required time to the CS 
(14). CS (14) may then play out the frames without wasting any bandwidth. When the 
5 CS ends the interactive function and resumes the normal play, all of the CS buffer is 
cleared as they are no longer valid. Then, a J-stream is sent by the DIS (18) to transmit 
data at a faster rate to the CS (14) to fill up the buffer for the merging operation as 
mentioned in previous section. 

In order to know which packet the J-stream should carry in order to match the 
10 play-time, the required packet sequence number p should be set equal to (play-time x 
transmission rate of M(i)-stream)/x, where x is the packet size in bit. 

Figure 7 shows how to determine the suitable M(i)-stream after FE It can be 
realized that the actual play-time for, say, a 20-seconds 4X FF is 80 seconds. Tff is the 
time for FF action and T F iu is the same as defined previously. The play-time has gone for 
15 (Pmc - Pff), where Pff is the play-time to begin FF and P M c is the play-time to resume the 
normal multicast M(i)-stream. (Tff + T Fi ii ) is the total time for the split and merge 
operation. Thus, the stream has been played for a time (Tff + T F ni ). In Figure 8, which 
shows the difference between play-point for FF operation and normal play back 
operation, it is shown that [ (P MC - ) - (T^ + T^ ) ] is the play-time of the new multicast 
20 stream ahead of the play-time of the original multicast stream. Suppose the CS is 
originally in the M(k)-stream. The algorithm for FF operation is as follows: 

If '/ox stream interval < (P MC - P FF )- (T& + T Fm ) 
< (nT+7 )x stream interval 
then merge to M(k - m) stream 

where m should be positive as it needs to go to toe previous stream in order xo get to the 
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10 



later part of the viewing. 

For REW, the operation is similar to FF. It will bring the play-time backwards. 
The DIS will send a pre-record 1X/2X/4X reverse I-stream to CS and J-stream is also 
used to fill-up the buffer for merging. However, referring to Fig. 9, now [Tff + T Fi ii + 
(Prew - Pmc)] is the play-time behind the current position. Prew is the time to start REW. 
The algorithm for REW operation is as follows: 

If mx stream interval < 7^, + T m + (Prew ~ P MC ) 
< (m+ 1 )y. stream interval 
then merge to M(k+ m) stream 

where m should be positive in order to go to later stream for the earlier part of the 
viewing. 



Jum p Forward/Jump Backward fJF/JB ) 

Jump Forward is to go to a specific play time immediately. It is an advanced 
feature in VCD and DVD players which allows user to search frames by directly going to 

15 that play-time. . . 

When a user issues a JF request in the media delivery system (10) of this 
invention, it will first determine whether the target time frame is in the CS buffer. If yes, 
the user can be served by just moving the play-point position in the buffer to the required 
frames. If no, a J-stream beginning at the required time will be sent immediately from the 
20 DIS. The CS clears (CS) its own buffer and plays frames from J-stream. It accepts the J- 

stream until the buffer is full, then leaves the J-stream and merges back to a M(i)-stream. 
Figm e 1 0 shows-a-partieular-exafflp 



time mark to the 130sec time mark. Pjf is the time when JF starts. Just like the..ather_. 
interactive functions, the CS needs to fi nd ti ie-stntable-M(i)-sfee am for merging backJhe— 
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algorithm is similar to FF: 

If zo x stream interval < (P MC - )- T FjV 
< (m+ 1 Jx stream interval 
then merge to M(k-m) stream 

where m should be positive as it requests the later part of the viewing by jumping back 

to the previous stream. 

5 The JB operation is similar to IF, except that JB will jump to a play-time at the 

earlier part of the viewing. 

If mx stream interval < T FiU + (P m - P MC ) 
< ( m+ IJx stream interval 
then merge to M(k+ m) stream 

where m should be positive as it requests the earlier part of the viewing by 
jumping to the later stream. 

10 

It has long been a difficult problem to provide fully interactive functions in a 
multicast VOD system. The media delivery system (10) of this invention may allow the 
user to perform fully interactive functions, including pause, slow motion, fast 
forward/rewind, jump forward, and jump rewind which require a relatively low system 
15 resources to function. This may be achieved by limiting the number of unicast media 
streams during the operation of the interactive functions. Therefore, the overall costs of 
ownership of such VOD systems providing interactive functions may be reduced. 

W hile the prefeiTed ^mlxxuTOent uf the present-iixventien-has-been-deseri-bed-^B- 

20 detail by the examples, it is apparent that modifications and adaptations of the present 
^.Bventiofl-w*l^oeeu^to--fo^^ understood, however , _ 
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that such modifications and adaptations are within the scope of the present invention, as 
set forth in the following claims. Furthermore, the embodiments of the present invention 
shall not be interpreted to be restricted by the examples or figures only. 
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CLAIMS: 

1. A method for delivering media to a plurality of media client having a buffer for 
caching media of a selected media stream within one stream interval and 
5 processing capability for playing the media in a multicast media stream through a 

network, including the steps of: 

generating plurality of multicast media streams, wherein each multicast 
media stream is repeated at regular stream intervals; 

joining the media client to a selected multicast media stream in response to 
10 a selection request from the media client; 

caching the buffer of the media client continuously with unplayed media in 
the selected multicast media stream; and 

caching the selected multicast media streams in at least one interactive 
server, 

15. such that interactive requests and/or errors in playing the media in the media client 

are handled by the interactive server or the media server. . m 



2. The method of Claim L wherein the interactive requests and the errors in playing 
the media in the media client are handled by the interactive server. 

20 

3. The method of Claim 1, wherein the media in each multicast media streams- are 
sent in packets of data, and the packets are interleaved randomly. 



4. The method of Claim 1, wherein the stream interval is 30 to 60 seconds. 



25 



21 



10 



A 0 PCT/IB00/01 857 

P/D:98 17836 



5. The method of Claim 1 further including the step of generating a dedicated unicast 
media stream from the media server or the interactive server and delivering to the 
media client in response to a dedicated interactive request from the media client 
requesting a dedicated media. 

6. The method of Claim 5, wherein the dedicated unicast media stream is generated 
from the interactive server if the interactive server contains the dedicated media. 

7. The method of Claim 5, wherein the dedicated unicast media stream is generated 
from the media server if the interactive server does not contain the dedicated 
media. 



8. The method of Claim 5, wherein the dedicated unicast media stream is generated 
from the interactive server after the dedicated media is delivered from the media 

15 server to the interactive server, if the interactive server does not contain the 

dedicated media. 

9. The method of Claim 1, wherein the interactive request includes any one or more 
of pause, slow motion, fast forward, rewind, jump forward, and jump backward. 



20 

10. 



A system for delivering media selection to a plurality of media clients having a 
buffer for caching media of a selected media stream within one stream interval 
_and^ocessing_caR ability for playing the media m a m ulticast media stream 



through a network, including 
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at least one media server for generating a plurality of multicast media 
streams, wherein each multicast media stream is repeated at regular stream 
intervals, and the media client is joined to a selected multicast media 
stream in response to a selection request from the media client 
at least one interactive server for caching the selected multicast media 
stream 

such that interactive requests and/or errors in playing the media in the media client 
are handled by the interactive server or the media server. 

The system of Claim 10, wherein the interactive requests and the errors in playing 
the media in the media client are handled by the interactive server. 

The system of Claim 1 0, wherein the media in each multicast media streams are 
sent in packets of data, and the packets are interleaved randomly. 

The system of Claim 10, wherein the stream interval is 30 to 60 seconds. 

The system of Claim 10, wherein a dedicated unicast media stream is generated 
from the media server or the interactive server and delivered to the media client in 
response to a dedicated interactive request from the media client requesting a 
dedicated media. 

15. The s v$tem~orCIaIm"l 4 , wherein toe dedicated unicast media stream is generated 
from the interactive server if the interactive server contains the dedicated media; 

25 
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1 6. The system of Claim 14, wherein the dedicated unicast media stream is generated 
from the media server if the interactive server does not contain the dedicated 
media. 

17. The system of Claim 14, wherein the dedicated unicast media stream is generated 
from the interactive server after the dedicated media is delivered from the media 
server to the interactive server, if the interactive server does not contain the 
dedicated media. 

1 8. The system of Claim 1 0, wherein the interactive request includes any one or more 
of pause, slow motion, fast forward, rewind, jump forward, and jump backward. 
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Abstract 

In a largescale video-on-demand (VOD) system, the scalability and the provision of 
truly interactive functions are two difficult problems which have not been resolved 
satisfactorily. It is easy to provide fully interactive functions using unicast streams but 
5 these systems are limited in their scalability which affect the cost of service provisioning. 
Batching systems based on multicast streaming, on the other hand, can increase the 
scalability but it is difficult to provide interactive functions on these systems. This 
invention provides a media delivery system having a novel architecture aiming at serving 
millions of users in a metropolitan area. It features hybrid multicast-unicast streaming to 
10 achieve both scalability and full interactivity through the provision of distributed 
interactive servers, which cached the multicast media streams generated by the media 
servers. The interactive servers may also provide fault tolerant routing and error recovery. 
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